27 research outputs found

    Large Scale Question Paraphrase Retrieval with Smoothed Deep Metric Learning

    Full text link
    The goal of a Question Paraphrase Retrieval (QPR) system is to retrieve equivalent questions that result in the same answer as the original question. Such a system can be used to understand and answer rare and noisy reformulations of common questions by mapping them to a set of canonical forms. This has large-scale applications for community Question Answering (cQA) and open-domain spoken language question answering systems. In this paper we describe a new QPR system implemented as a Neural Information Retrieval (NIR) system consisting of a neural network sentence encoder and an approximate k-Nearest Neighbour index for efficient vector retrieval. We also describe our mechanism to generate an annotated dataset for question paraphrase retrieval experiments automatically from question-answer logs via distant supervision. We show that the standard loss function in NIR, triplet loss, does not perform well with noisy labels. We propose smoothed deep metric loss (SDML) and with our experiments on two QPR datasets we show that it significantly outperforms triplet loss in the noisy label setting

    Learning When Not to Answer: A Ternary Reward Structure for Reinforcement Learning based Question Answering

    Full text link
    In this paper, we investigate the challenges of using reinforcement learning agents for question-answering over knowledge graphs for real-world applications. We examine the performance metrics used by state-of-the-art systems and determine that they are inadequate for such settings. More specifically, they do not evaluate the systems correctly for situations when there is no answer available and thus agents optimized for these metrics are poor at modeling confidence. We introduce a simple new performance metric for evaluating question-answering agents that is more representative of practical usage conditions, and optimize for this metric by extending the binary reward structure used in prior work to a ternary reward structure which also rewards an agent for not answering a question rather than giving an incorrect answer. We show that this can drastically improve the precision of answered questions while only not answering a limited number of previously correctly answered questions. Employing a supervised learning strategy using depth-first-search paths to bootstrap the reinforcement learning algorithm further improves performance.Comment: Accepted at NAACL 2019. Version 1 was presented at NIPS 2018 workshop on Relational Representation Learnin

    Influence of soil fabric on dynamic properties of sand: An experimental study

    Get PDF
    Natural field sites originate under the action of external sources such as rivers, wind, or marine environments. These sources are responsible for constituting a variety of soil fabrics, which ultimately modify the deformation characteristics. Additionally, the dynamic properties of a site present the characterization of a region and have been profusely utilized by field engineers and researchers. In the present study, the dynamic properties of soil specimens have been evaluated for specimen preparation techniques, namely, air pluviation (AP) and water sedimentation (WS). The cyclic triaxial tests were conducted on the strain-controlled consolidated undrained specimens at a frequency of 0.1 Hz. This frequency has been used in several studies to replicate earthquake or liquefaction scenarios. The calculation of shear modulus (G) and damping ratio (D) was performed using symmetric hysteresis loops generated through cyclic loadings. The outcomes suggest that the specimen prepared using the WS technique possesses a larger shear modulus value than AP ones. The reason behind this observation was the lower degradation characteristics of the WS-prepared specimens. Additionally, the liquefaction susceptibility of the specimens has been noticed for different specimens. © 2023 ISEC Press

    Liquefaction proneness of stratified sand-silt layers based on cyclic triaxial tests

    Get PDF
    Most studies on liquefaction have addressed homogeneous soil strata using sand or sand with fine content without considering soil stratification. In this study, cyclic triaxial tests were conducted on the stratified sand specimens embedded with the silt layers to investigate the liquefaction failures and void-redistribution at confining stress of 100 kPa under stress-controlled mode. The loosening of underlying sand mass and hindrance to pore-water flow caused localized bulging at the sand-silt interface. It is observed that at a silt thickness of 0.2H (H is the height of the specimen), nearly 187 load cycles were required to attain liquefaction, which was the highest among all the silt thicknesses with a single silt layer. Therefore, 0.2H is assumed as the optimum silt thickness (topt). The silt was placed at the top, middle and bottom of the specimen to understand the effect of silt layer location. Due to the increase in depth of the silt layer from the top position (capped soil state) to the bottom, the cycles to reach liquefaction (Ncyc,L) increased 2.18 times. Also, when the number of silt layers increased from single to triple, there was an increase of about 880% in Ncyc,L. The micro-characterization analysis of the soil specimens indicated silty materials transported in upper sections of the specimen due to the dissipated pore pressure. The main parameters, including thickness (t), location (z), cyclic stress ratio (CSR), number of silt layers (n) and modified relative density (Dr,m), performed significantly in governing the liquefaction resistance. For this, a multilinear regression model is developed based on critical parameters for prediction of Ncyc,L. Furthermore, the developed constitutive model has been validated using the data from the present study and earlier findings

    Using Pairwise Occurrence Information to Improve Knowledge Graph Completion on Large-Scale Datasets

    Get PDF
    Bilinear models such as DistMult and ComplEx are effective methods for knowledge graph (KG) completion. However, they require large batch sizes, which becomes a performance bottleneck when training on large scale datasets due to memory constraints. In this paper we use occurrences of entity-relation pairs in the dataset to construct a joint learning model and to increase the quality of sampled negatives during training. We show on three standard datasets that when these two techniques are combined, they give a significant improvement in performance, especially when the batch size and the number of generated negative examples are low relative to the size of the dataset. We then apply our techniques to a dataset containing 2 million entities and demonstrate that our model outperforms the baseline by 2.8% absolute on [email protected]: 8 pages, 3 figures, accepted at EMNLP 201

    A HYBRID S N -P N FORMULATION FOR SOLUTION OF THE BOLTZMANN TRANSPORT EQUATION FOR PHONONS

    Get PDF
    ABSTRACT A generalized form of the Ballistic-Diffusive Equations (BDE) for approximate solution of the Boltzmann Transport Equation (BTE) for phonons is formulated. The formulation presented here is new and general in the sense that, unlike previously published formulations of the BDE, it does not require a priori knowledge of the specific heat capacity of the material. Furthermore, it does not introduce artifacts such as media and ballistic temperatures. As a consequence, the boundary conditions have clear physical meaning. In formulating the BDE, the phonon intensity is split into two components: ballistic and diffusive. The ballistic component is traditionally determined using a viewfactor formulation, while the diffusive component is solved by invoking spherical harmonics expansions. Use of the viewfactor approach for the ballistic component is prohibitive for complex large-scale geometries. Instead, in this work, the ballistic equation is solved using two different established methods that are appropriate for use in complex geometries, namely the discrete ordinates method (DOM), and the control angle discrete ordinates method (CADOM). Results of each method for solving the BDE are compared against benchmark Monte Carlo results, as well as solutions of the BTE using standalone DOM and CADOM for a two-dimensional transient heat conduction problem at various Knudsen numbers. It is found that standalone CADOM (for BTE) and hybrid CADOM-P 1 (for BDE) yield the best accuracy. The hybrid CADOM-P 1 is found to be the best method in terms of computational efficiency. NOMENCLATUR
    corecore